Search CORE

6 research outputs found

Mining Query Plans for Finding Candidate Queries and Sub-Queries for Materialized Views in BI Systems Without Cube Generation

Author: Deshpande Parag
Deshpande Srijay
Kshirsagar Amit
Thakare Atul
Publication venue: Institute of Informatics, Slovak Academy of Sciences
Publication date: 31/05/2019
Field of study

Materialized views are important for optimizing Business Intelligence (BI) systems when they are designed without data cubes. Selecting candidate queries from large number of queries for materialized views is a challenging task. Most of the work done in the past involves finding out frequent queries from the past workload and creating materialized views from such queries by either manually analyzing workload or using approximate string matching algorithms using query text. Most of the existing methods suggest complete queries but ignore query components such as sub queries for creation of materialized views. This paper presents a novel method to determine on which queries and query components materialized views can be created to optimize aggregate and join queries by mining database of query execution plans which are in the form of binary trees. The proposed algorithm showed significant improvement in terms of more number of optimized queries because it is using the execution plan tree of the query as a basis of selection of query to be optimized using materialized views rather than choosing query text which is used by traditional methods. For selecting a correct set of queries to be optimized using materialized views, the paper proposes efficient specialized frequent tree component mining algorithm with novel heuristics to prune search space. These frequent components are used to determine the possible set of candidate queries for creation of materialized views. Experimentation on standard, real and synthetic data sets, and also the theoretical basis, proved that the proposed method is able to optimize a large number of queries with less number of materialized views and showed a significant improvement in performance compared to traditional methods

Computing and Informatics (E-Journal - Institute of Informatics, SAS, Bratislava)

SynCLay: Interactive Synthesis of Histology Images from Bespoke Cellular Layouts

Author: Dawood Muhammad
Deshpande Srijay
Minhas Fayyaz
Rajpoot Nasir
Publication venue
Publication date: 28/12/2022
Field of study

Automated synthesis of histology images has several potential applications in computational pathology. However, no existing method can generate realistic tissue images with a bespoke cellular layout or user-defined histology parameters. In this work, we propose a novel framework called SynCLay (Synthesis from Cellular Layouts) that can construct realistic and high-quality histology images from user-defined cellular layouts along with annotated cellular boundaries. Tissue image generation based on bespoke cellular layouts through the proposed framework allows users to generate different histological patterns from arbitrary topological arrangement of different types of cells. SynCLay generated synthetic images can be helpful in studying the role of different types of cells present in the tumor microenvironmet. Additionally, they can assist in balancing the distribution of cellular counts in tissue images for designing accurate cellular composition predictors by minimizing the effects of data imbalance. We train SynCLay in an adversarial manner and integrate a nuclear segmentation and classification model in its training to refine nuclear structures and generate nuclear masks in conjunction with synthetic images. During inference, we combine the model with another parametric model for generating colon images and associated cellular counts as annotations given the grade of differentiation and cell densities of different cells. We assess the generated images quantitatively and report on feedback from trained pathologists who assigned realism scores to a set of images generated by the framework. The average realism score across all pathologists for synthetic images was as high as that for the real images. We also show that augmenting limited real data with the synthetic data generated by our framework can significantly boost prediction performance of the cellular composition prediction task

arXiv.org e-Print Archive

Warwick Research Archives Portal Repository

TIAToolbox as an end-to-end library for advanced tissue image analytics

Author: Bashir Raja Muhammad Saad
Bilal Mohsin
Deshpande Srijay
Epstein D. B. A.
Graham Simon
Hadjigeorghiou Giorgos
Jahanifar Mostafa
Lu Wenqi
Minhas Fayyaz ul Amir Afsar
Pocock Johnathan
Rajpoot Nasir M.
Raza Shan-e-Ahmed
Shephard Adam
Vu Quoc Dang
Publication venue: Nature Publishing Group UK
Publication date: 24/09/2022
Field of study

Background: Computational pathology has seen rapid growth in recent years, driven by advanced deep-learning algorithms. Due to the sheer size and complexity of multi-gigapixel whole-slide images, to the best of our knowledge, there is no open-source software library providing a generic end-to-end API for pathology image analysis using best practices. Most researchers have designed custom pipelines from the bottom up, restricting the development of advanced algorithms to specialist users. To help overcome this bottleneck, we present TIAToolbox, a Python toolbox designed to make computational pathology accessible to computational, biomedical, and clinical researchers. Methods: By creating modular and configurable components, we enable the implementation of computational pathology algorithms in a way that is easy to use, flexible and extensible. We consider common sub-tasks including reading whole slide image data, patch extraction, stain normalization and augmentation, model inference, and visualization. For each of these steps, we provide a user-friendly application programming interface for commonly used methods and models. Results: We demonstrate the use of the interface to construct a full computational pathology deep-learning pipeline. We show, with the help of examples, how state-of-the-art deep-learning algorithms can be reimplemented in a streamlined manner using our library with minimal effort. Conclusions: We provide a usable and adaptable library with efficient, cutting-edge, and unit-tested tools for data loading, pre-processing, model inference, post-processing, and visualization. This enables a range of users to easily build upon recent deep-learning developments in the computational pathology literature

PubMed Central

Warwick Research Archives Portal Repository

NEUROSURGERY ENTHUSIASTIC WOMEN SOCIETY

CoNIC Challenge: Pushing the Frontiers of Nuclear Detection, Segmentation, Classification and Counting

Author: Ahn Heeyoung
Aviles-Rivero Angelica I.
Azzuni Hussam
Bashir Raja Muhammad Saad
Baumann Elias
Blache Marie-Claire
Böhland Moritz
Campilho Aurélio
Cardoso Jaime S.
Cheng Jijun
Chien Hsiang-Chin
Costa Pedro
Dawood Muhammad
Deshpande Srijay
Devika R. G.
Dubey Yash
Dumbhare Pranay
Fang Zijie
Graham Simon
Han Chu
Hirsch Peter
Hong Chenyang
Hong Yiyu
Hrishikesh P. S.
Huang Banban
Jahanifar Mostafa
Jain Ayushi
Jamthikar Ankush
Jiji C. V.
Jung Hyun
Kainmueller Dagmar
Kasai Satoshi
Kim Soo-Hyung
Kondo Satoshi
Kwak Jin Tae
Lee Chia-Yen
Lee Taebum
Li Jiachen
Lin Chunhui
Lin Hong-Kun
Lin Zhifan
Liu Lihao
Liu Shuolin
Liu Zaiyi
Löffler Katharina
Mao Lijian
Meda Yughender
Miao Tianyi
Mikut Ralf
Minhas Fayyaz
Mishra Prakash
Neumann Oliver
Nunes João D.
Pan Xipeng
Phuse Vedant
Piégu Benoît
Puthussery Densen
Rajpoot Nasir M.
Raza Shan E. Ahmed
Reischl Markus
Ridzuan Muhammad
Rumberger Josef Lorenz
Scherr Tim
Schilling Marcel P.
Schmidt Uwe
Schönlieb Carola-Bibiane
Shephard Adam
Snead David
Talsania Dhairya
The CoNIC Challenge Consortium
Vernay Bertrand
Vo Vi Thi-Tuong
Vu Quoc Dang
Vuong Trinh Thi Le
Wang Ching-Ping
Wang Chixin
Wang Xiyue
Weigert Martin
Wu Min
Xiang Jinxi
Xu Min
Yang Sen
Yaqub Mohammad
Ying Weiqin
Zhang Jun
Zhang Liukun
Zhang Wenhua
Zhang Ye
Zhang Yongbing
Ziaei Dorsa
Publication venue
Publication date: 14/03/2023
Field of study

Nuclear detection, segmentation and morphometric profiling are essential in helping us further understand the relationship between histology and patient outcome. To drive innovation in this area, we setup a community-wide challenge using the largest available dataset of its kind to assess nuclear segmentation and cellular composition. Our challenge, named CoNIC, stimulated the development of reproducible algorithms for cellular recognition with real-time result inspection on public leaderboards. We conducted an extensive post-challenge analysis based on the top-performing models using 1,658 whole-slide images of colon tissue. With around 700 million detected nuclei per model, associated features were used for dysplasia grading and survival analysis, where we demonstrated that the challenge's improvement over the previous state-of-the-art led to significant boosts in downstream performance. Our findings also suggest that eosinophils and neutrophils play an important role in the tumour microevironment. We release challenge models and WSI-level results to foster the development of further methods for biomarker discovery

arXiv.org e-Print Archive

KITopen

SAFRON : Stitching Across the Frontier Network for generating colorectal cancer histology images

Author: Deshpande Srijay
Graham Simon
Minhas Fayyaz ul Amir Afsar
Rajpoot Nasir M. (Nasir Mahmood)
Publication venue: 'Elsevier BV'
Publication date: 01/04/2022
Field of study

Automated synthesis of histology images has several potential applications including the development of data-efficient deep learning algorithms. In the field of computational pathology, where histology images are large in size and visual context is crucial, synthesis of large high-resolution images via generative modeling is an important but challenging task due to memory and computational constraints. To address this challenge, we propose a novel framework called SAFRON (Stitching Across the FROntier Network) to construct realistic, large high-resolution tissue images conditioned on input tissue component masks. The main novelty in the framework is integration of stitching in its loss function which enables generation of images of arbitrarily large sizes after training on relatively small image patches while preserving morphological features with minimal boundary artifacts. We have used the proposed framework for generating, to the best of our knowledge, the largest-sized synthetic histology images to date (up to 11K×8K pixels). Compared to existing approaches, our framework is efficient in terms of the memory required for training and computations needed for synthesizing large high-resolution images. The quality of generated images was assessed quantitatively using Frechet Inception Distance as well as by 7 trained pathologists, who assigned a realism score to a set of images generated by SAFRON. The average realism score across all pathologists for synthetic images was as high as that of real images. We also show that training with additional synthetic data generated by SAFRON can significantly boost prediction performance of gland segmentation and cancer detection algorithms in colorectal cancer histology images. [Abstract copyright: Copyright © 2021. Published by Elsevier B.V.

Warwick Research Archives Portal Repository